Initialization method for speech separation algorithms that work in the time-frequency domain
نویسندگان
چکیده
منابع مشابه
A Study of Methods for Initialization and Permutation Alignment for Time-Frequency Domain Blind Source Separation
The problem of the blind signal separation (BSS) consists of estimating the latent component signals in a linear mixture, referred to as the sources, starting from several observed signals, without relying on any specific knowledge of the sources. In particular, when the sources are audible, this problem is known as to the cocktail-party problem, making reference to the ability of the human ear...
متن کاملQuantitative Comparisons between Time Domain Speech Fundamental Frequency Estimation Algorithms
. T W O techn iques a r e presented here t o enable q u a n t i t a t i v e comparison o f t i m e domain fundamental f requency e s t i m a t i o n a l g o r i t h m s a g a i n s t a r e fe rence , t h a t makes use o f t h e o u t p u t f rom a laryngograph. These measures a re c a r r i e d o u t on t h e p u l s a t i l e ou tpu t s produced by t h e dev ices, where each pu lse corresponds...
متن کاملBlind Signal Separation and Speech Recognition in the Frequency Domain
In this paper it is shown that a Blind Signal Separation (BSS) method in the frequency domain (FDBSS) improves significantly the speaker Signal to Interference Ratio (SIR) and the phoneme recognition score of a continuous speech, speaker-independent acoustic decoder in a multi-simultaneous-speaker office environment. Specifically, the efficiency of the presented FDBSS method is studied on a TIT...
متن کاملOn Initial Seed Selection for Frequency Domain Blind Speech Separation
In this paper we address the problem of initial seed selection for frequency domain iterative blind speech separation (BSS) algorithms. The derivation of the seeding algorithm is guided by the goal to select samples which are likely to be caused by source activity and not by noise and at the same time originate from different sources. The proposed algorithm has moderate computational complexity...
متن کاملFrequency Domain Blind Source Separation for Many Speech Signals
This paper presents a method for solving the permutation problem of frequency domain blind source separation (BSS) when the number of source signals is large, and the potential source locations are omnidirectional. We propose a combination of small and large spacing sensor pairs with various axis directions in order to obtain proper geometric information for solving the permutation problem. Exp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Journal of the Acoustical Society of America
سال: 2010
ISSN: 0001-4966
DOI: 10.1121/1.3310248